Overview

Brought to you by YData

Dataset statistics

Number of variables19
Number of observations19,768
Missing cells10,929
Missing cells (%)2.9%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory2.1 MiB
Average record size in memory110.0 B

Variable types

Categorical1
Boolean6
Text1
Numeric11

Alerts

created_at is highly overall correlated with public_gists and 1 other fieldsHigh correlation
followers is highly overall correlated with followers_log and 6 other fieldsHigh correlation
followers_log is highly overall correlated with followers and 6 other fieldsHigh correlation
following is highly overall correlated with followers and 4 other fieldsHigh correlation
following_log is highly overall correlated with followers and 4 other fieldsHigh correlation
label is highly overall correlated with text_bot_countHigh correlation
public_gists is highly overall correlated with created_at and 5 other fieldsHigh correlation
public_gists_log is highly overall correlated with created_at and 5 other fieldsHigh correlation
public_repos is highly overall correlated with followers and 6 other fieldsHigh correlation
public_repos_log is highly overall correlated with followers and 6 other fieldsHigh correlation
text_bot_count is highly overall correlated with label and 1 other fieldsHigh correlation
type is highly overall correlated with text_bot_count and 1 other fieldsHigh correlation
updated_at is highly overall correlated with typeHigh correlation
label is highly imbalanced (67.2%) Imbalance
type is highly imbalanced (92.8%) Imbalance
site_admin is highly imbalanced (95.8%) Imbalance
bio has 10929 (55.3%) missing values Missing
public_repos is highly skewed (γ1 = 53.8847472) Skewed
public_gists is highly skewed (γ1 = 74.09063706) Skewed
followers is highly skewed (γ1 = 32.46602776) Skewed
following is highly skewed (γ1 = 39.87415424) Skewed
public_repos has 942 (4.8%) zeros Zeros
public_gists has 7961 (40.3%) zeros Zeros
followers has 1445 (7.3%) zeros Zeros
following has 6017 (30.4%) zeros Zeros
text_bot_count has 19003 (96.1%) zeros Zeros
public_repos_log has 942 (4.8%) zeros Zeros
public_gists_log has 7961 (40.3%) zeros Zeros
followers_log has 1445 (7.3%) zeros Zeros
following_log has 6017 (30.4%) zeros Zeros

Reproduction

Analysis started2024-11-27 03:58:32.755339
Analysis finished2024-11-27 03:58:43.288582
Duration10.53 seconds
Software versionydata-profiling vv4.12.0
Download configurationconfig.json

Variables

label
Categorical

High correlation  Imbalance 

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size154.6 KiB
Human
18578 
Bot
 
1190

Length

Max length5
Median length5
Mean length4.8796034
Min length3

Characters and Unicode

Total characters96,460
Distinct characters8
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowHuman
2nd rowHuman
3rd rowHuman
4th rowBot
5th rowHuman

Common Values

ValueCountFrequency (%)
Human 18578
94.0%
Bot 1190
 
6.0%

Length

2024-11-27T11:58:43.378188image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-11-27T11:58:43.446190image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
ValueCountFrequency (%)
human 18578
94.0%
bot 1190
 
6.0%

Most occurring characters

ValueCountFrequency (%)
H 18578
19.3%
u 18578
19.3%
m 18578
19.3%
a 18578
19.3%
n 18578
19.3%
B 1190
 
1.2%
o 1190
 
1.2%
t 1190
 
1.2%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 76692
79.5%
Uppercase Letter 19768
 
20.5%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
u 18578
24.2%
m 18578
24.2%
a 18578
24.2%
n 18578
24.2%
o 1190
 
1.6%
t 1190
 
1.6%
Uppercase Letter
ValueCountFrequency (%)
H 18578
94.0%
B 1190
 
6.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 96460
100.0%

Most frequent character per script

Latin
ValueCountFrequency (%)
H 18578
19.3%
u 18578
19.3%
m 18578
19.3%
a 18578
19.3%
n 18578
19.3%
B 1190
 
1.2%
o 1190
 
1.2%
t 1190
 
1.2%

Most occurring blocks

ValueCountFrequency (%)
ASCII 96460
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
H 18578
19.3%
u 18578
19.3%
m 18578
19.3%
a 18578
19.3%
n 18578
19.3%
B 1190
 
1.2%
o 1190
 
1.2%
t 1190
 
1.2%

type
Boolean

High correlation  Imbalance 

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size19.4 KiB
True
19597 
False
 
171
ValueCountFrequency (%)
True 19597
99.1%
False 171
 
0.9%
2024-11-27T11:58:43.501368image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/

site_admin
Boolean

Imbalance 

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size19.4 KiB
False
19678 
True
 
90
ValueCountFrequency (%)
False 19678
99.5%
True 90
 
0.5%
2024-11-27T11:58:43.554188image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/

company
Boolean

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size19.4 KiB
True
10794 
False
8974 
ValueCountFrequency (%)
True 10794
54.6%
False 8974
45.4%
2024-11-27T11:58:43.609407image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/

blog
Boolean

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size19.4 KiB
False
11256 
True
8512 
ValueCountFrequency (%)
False 11256
56.9%
True 8512
43.1%
2024-11-27T11:58:43.665744image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/

location
Boolean

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size19.4 KiB
True
12691 
False
7077 
ValueCountFrequency (%)
True 12691
64.2%
False 7077
35.8%
2024-11-27T11:58:43.723546image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/

hireable
Boolean

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size19.4 KiB
False
16470 
True
3298 
ValueCountFrequency (%)
False 16470
83.3%
True 3298
 
16.7%
2024-11-27T11:58:43.780282image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/

bio
Text

Missing 

Distinct8641
Distinct (%)97.8%
Missing10929
Missing (%)55.3%
Memory size154.6 KiB
2024-11-27T11:58:43.970238image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/

Length

Max length160
Median length116
Mean length61.460459
Min length1

Characters and Unicode

Total characters543,249
Distinct characters1,746
Distinct categories23 ?
Distinct scripts18 ?
Distinct blocks45 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique8,574 ?
Unique (%)97.0%

Sample

1st rowI just press the buttons randomly, and the program evolves...
2nd rowTime is unimportant, only life important.
3rd rowDone studying. Need challenges.
4th rowAdministrator of MOONGIFT that is introducing open source software everyday to Japanese engineers since 2004.
5th rowSenior Software Engineer at Google, working on Certificate Transparency and generalized transparency.
ValueCountFrequency (%)
3069
 
3.9%
and 2526
 
3.2%
engineer 1583
 
2.0%
software 1521
 
1.9%
of 1488
 
1.9%
at 1380
 
1.8%
developer 1236
 
1.6%
the 1086
 
1.4%
a 1038
 
1.3%
i 1033
 
1.3%
Other values (14754) 62407
79.6%
2024-11-27T11:58:44.286226image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
70014
 
12.9%
e 49589
 
9.1%
o 32360
 
6.0%
n 31402
 
5.8%
a 31366
 
5.8%
t 31195
 
5.7%
r 31181
 
5.7%
i 28526
 
5.3%
s 19655
 
3.6%
l 14767
 
2.7%
Other values (1736) 203194
37.4%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 388595
71.5%
Space Separator 70192
 
12.9%
Uppercase Letter 43745
 
8.1%
Other Punctuation 23761
 
4.4%
Decimal Number 3557
 
0.7%
Control 2914
 
0.5%
Dash Punctuation 2560
 
0.5%
Other Letter 2016
 
0.4%
Other Symbol 2014
 
0.4%
Math Symbol 1750
 
0.3%
Other values (13) 2145
 
0.4%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
25
 
1.2%
20
 
1.0%
20
 
1.0%
14
 
0.7%
13
 
0.6%
13
 
0.6%
12
 
0.6%
11
 
0.5%
11
 
0.5%
11
 
0.5%
Other values (912) 1866
92.6%
Other Symbol
ValueCountFrequency (%)
141
 
7.0%
💻 86
 
4.3%
🍕 81
 
4.0%
71
 
3.5%
62
 
3.1%
👨 58
 
2.9%
58
 
2.9%
🚀 46
 
2.3%
🐁 39
 
1.9%
39
 
1.9%
Other values (429) 1333
66.2%
Lowercase Letter
ValueCountFrequency (%)
e 49589
12.8%
o 32360
 
8.3%
n 31402
 
8.1%
a 31366
 
8.1%
t 31195
 
8.0%
r 31181
 
8.0%
i 28526
 
7.3%
s 19655
 
5.1%
l 14767
 
3.8%
c 14228
 
3.7%
Other values (137) 104326
26.8%
Nonspacing Mark
ValueCountFrequency (%)
204
60.7%
̶ 10
 
3.0%
͡ 6
 
1.8%
̭ 6
 
1.8%
̯ 6
 
1.8%
͉ 6
 
1.8%
́ 5
 
1.5%
̘ 4
 
1.2%
̪ 4
 
1.2%
͜ 4
 
1.2%
Other values (45) 81
 
24.1%
Uppercase Letter
ValueCountFrequency (%)
S 5685
13.0%
C 3807
 
8.7%
E 3010
 
6.9%
I 2927
 
6.7%
P 2841
 
6.5%
D 2744
 
6.3%
A 2743
 
6.3%
M 2331
 
5.3%
T 2121
 
4.8%
F 1734
 
4.0%
Other values (34) 13802
31.6%
Other Punctuation
ValueCountFrequency (%)
. 7699
32.4%
, 5911
24.9%
@ 4168
17.5%
/ 2005
 
8.4%
: 865
 
3.6%
' 750
 
3.2%
& 663
 
2.8%
! 383
 
1.6%
# 310
 
1.3%
221
 
0.9%
Other values (24) 786
 
3.3%
Math Symbol
ValueCountFrequency (%)
| 1137
65.0%
+ 407
 
23.3%
> 70
 
4.0%
= 43
 
2.5%
< 39
 
2.2%
~ 26
 
1.5%
8
 
0.5%
4
 
0.2%
3
 
0.2%
2
 
0.1%
Other values (10) 11
 
0.6%
Decimal Number
ValueCountFrequency (%)
2 659
18.5%
0 591
16.6%
1 581
16.3%
3 361
10.1%
9 268
7.5%
8 240
 
6.7%
6 234
 
6.6%
4 224
 
6.3%
5 216
 
6.1%
7 179
 
5.0%
Other values (3) 4
 
0.1%
Close Punctuation
ValueCountFrequency (%)
) 570
84.9%
] 58
 
8.6%
} 18
 
2.7%
9
 
1.3%
5
 
0.7%
4
 
0.6%
3
 
0.4%
2
 
0.3%
1
 
0.1%
1
 
0.1%
Open Punctuation
ValueCountFrequency (%)
( 536
85.1%
[ 57
 
9.0%
{ 19
 
3.0%
7
 
1.1%
3
 
0.5%
2
 
0.3%
2
 
0.3%
2
 
0.3%
1
 
0.2%
1
 
0.2%
Modifier Symbol
ValueCountFrequency (%)
🏻 30
34.5%
¯ 16
18.4%
` 14
16.1%
🏽 10
 
11.5%
🏼 9
 
10.3%
^ 3
 
3.4%
🏾 3
 
3.4%
2
 
2.3%
Private Use
ValueCountFrequency (%)
6
40.0%
2
 
13.3%
2
 
13.3%
2
 
13.3%
1
 
6.7%
1
 
6.7%
1
 
6.7%
Space Separator
ValueCountFrequency (%)
70014
99.7%
  61
 
0.1%
48
 
0.1%
  40
 
0.1%
27
 
< 0.1%
2
 
< 0.1%
Modifier Letter
ValueCountFrequency (%)
10
62.5%
ˌ 2
 
12.5%
ˈ 2
 
12.5%
1
 
6.2%
ː 1
 
6.2%
Other Number
ValueCountFrequency (%)
² 2
33.3%
1
16.7%
¹ 1
16.7%
1
16.7%
¼ 1
16.7%
Dash Punctuation
ValueCountFrequency (%)
- 2511
98.1%
30
 
1.2%
18
 
0.7%
1
 
< 0.1%
Format
ValueCountFrequency (%)
142
96.6%
2
 
1.4%
­ 2
 
1.4%
1
 
0.7%
Final Punctuation
ValueCountFrequency (%)
29
60.4%
14
29.2%
» 5
 
10.4%
Currency Symbol
ValueCountFrequency (%)
$ 13
68.4%
5
 
26.3%
£ 1
 
5.3%
Initial Punctuation
ValueCountFrequency (%)
12
70.6%
4
 
23.5%
« 1
 
5.9%
Connector Punctuation
ValueCountFrequency (%)
_ 147
96.7%
5
 
3.3%
Control
ValueCountFrequency (%)
2914
100.0%
Letter Number
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 432048
79.5%
Common 108419
 
20.0%
Han 1521
 
0.3%
Inherited 478
 
0.1%
Cyrillic 244
 
< 0.1%
Hangul 174
 
< 0.1%
Hiragana 155
 
< 0.1%
Katakana 79
 
< 0.1%
Arabic 67
 
< 0.1%
Greek 26
 
< 0.1%
Other values (8) 38
 
< 0.1%

Most frequent character per script

Han
ValueCountFrequency (%)
25
 
1.6%
20
 
1.3%
20
 
1.3%
14
 
0.9%
13
 
0.9%
13
 
0.9%
12
 
0.8%
11
 
0.7%
11
 
0.7%
11
 
0.7%
Other values (680) 1371
90.1%
Common
ValueCountFrequency (%)
70014
64.6%
. 7699
 
7.1%
, 5911
 
5.5%
@ 4168
 
3.8%
2914
 
2.7%
- 2511
 
2.3%
/ 2005
 
1.8%
| 1137
 
1.0%
: 865
 
0.8%
' 750
 
0.7%
Other values (574) 10445
 
9.6%
Latin
ValueCountFrequency (%)
e 49589
 
11.5%
o 32360
 
7.5%
n 31402
 
7.3%
a 31366
 
7.3%
t 31195
 
7.2%
r 31181
 
7.2%
i 28526
 
6.6%
s 19655
 
4.5%
l 14767
 
3.4%
c 14228
 
3.3%
Other values (107) 147779
34.2%
Hangul
ValueCountFrequency (%)
8
 
4.6%
7
 
4.0%
7
 
4.0%
5
 
2.9%
4
 
2.3%
4
 
2.3%
4
 
2.3%
4
 
2.3%
4
 
2.3%
3
 
1.7%
Other values (102) 124
71.3%
Inherited
ValueCountFrequency (%)
204
42.7%
142
29.7%
̶ 10
 
2.1%
͡ 6
 
1.3%
̭ 6
 
1.3%
̯ 6
 
1.3%
͉ 6
 
1.3%
́ 5
 
1.0%
̘ 4
 
0.8%
̪ 4
 
0.8%
Other values (46) 85
17.8%
Katakana
ValueCountFrequency (%)
10
 
12.7%
4
 
5.1%
4
 
5.1%
3
 
3.8%
3
 
3.8%
3
 
3.8%
3
 
3.8%
3
 
3.8%
3
 
3.8%
2
 
2.5%
Other values (32) 41
51.9%
Cyrillic
ValueCountFrequency (%)
а 27
 
11.1%
о 18
 
7.4%
т 18
 
7.4%
н 14
 
5.7%
е 13
 
5.3%
и 12
 
4.9%
в 12
 
4.9%
с 11
 
4.5%
у 9
 
3.7%
р 8
 
3.3%
Other values (31) 102
41.8%
Hiragana
ValueCountFrequency (%)
11
 
7.1%
11
 
7.1%
8
 
5.2%
7
 
4.5%
7
 
4.5%
7
 
4.5%
7
 
4.5%
6
 
3.9%
6
 
3.9%
6
 
3.9%
Other values (30) 79
51.0%
Arabic
ValueCountFrequency (%)
ا 10
14.9%
م 8
11.9%
و 7
10.4%
ت 6
 
9.0%
ل 5
 
7.5%
ر 4
 
6.0%
ع 4
 
6.0%
ة 3
 
4.5%
ي 3
 
4.5%
ى 2
 
3.0%
Other values (12) 15
22.4%
Greek
ValueCountFrequency (%)
ω 4
15.4%
λ 3
11.5%
π 2
 
7.7%
ς 2
 
7.7%
ρ 2
 
7.7%
θ 2
 
7.7%
ν 1
 
3.8%
Θ 1
 
3.8%
ο 1
 
3.8%
δ 1
 
3.8%
Other values (7) 7
26.9%
Hebrew
ValueCountFrequency (%)
מ 2
14.3%
ר 2
14.3%
ש 2
14.3%
ו 1
7.1%
א 1
7.1%
ל 1
7.1%
ע 1
7.1%
ה 1
7.1%
ח 1
7.1%
י 1
7.1%
Unknown
ValueCountFrequency (%)
6
40.0%
2
 
13.3%
2
 
13.3%
2
 
13.3%
1
 
6.7%
1
 
6.7%
1
 
6.7%
Tibetan
ValueCountFrequency (%)
1
50.0%
1
50.0%
Thai
ValueCountFrequency (%)
2
100.0%
Kannada
ValueCountFrequency (%)
2
100.0%
Mandaic
ValueCountFrequency (%)
1
100.0%
Egyptian_Hieroglyphs
ValueCountFrequency (%)
𓀡 1
100.0%
Devanagari
ValueCountFrequency (%)
1
100.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 537268
98.9%
None 1839
 
0.3%
CJK 1521
 
0.3%
Punctuation 576
 
0.1%
Block Elements 255
 
< 0.1%
Cyrillic 244
 
< 0.1%
VS 205
 
< 0.1%
Enclosed Alphanum Sup 181
 
< 0.1%
Hangul 165
 
< 0.1%
Dingbats 160
 
< 0.1%
Other values (35) 835
 
0.2%

Most frequent character per block

ASCII
ValueCountFrequency (%)
70014
 
13.0%
e 49589
 
9.2%
o 32360
 
6.0%
n 31402
 
5.8%
a 31366
 
5.8%
t 31195
 
5.8%
r 31181
 
5.8%
i 28526
 
5.3%
s 19655
 
3.7%
l 14767
 
2.7%
Other values (86) 197213
36.7%
Punctuation
ValueCountFrequency (%)
221
38.4%
142
24.7%
48
 
8.3%
30
 
5.2%
29
 
5.0%
27
 
4.7%
18
 
3.1%
14
 
2.4%
12
 
2.1%
12
 
2.1%
Other values (10) 23
 
4.0%
VS
ValueCountFrequency (%)
204
99.5%
1
 
0.5%
Block Elements
ValueCountFrequency (%)
141
55.3%
62
24.3%
39
 
15.3%
11
 
4.3%
1
 
0.4%
1
 
0.4%
None
ValueCountFrequency (%)
💻 86
 
4.7%
🍕 81
 
4.4%
73
 
4.0%
  61
 
3.3%
👨 58
 
3.2%
55
 
3.0%
· 54
 
2.9%
🚀 46
 
2.5%
  40
 
2.2%
🐁 39
 
2.1%
Other values (407) 1246
67.8%
Dingbats
ValueCountFrequency (%)
71
44.4%
58
36.2%
5
 
3.1%
5
 
3.1%
3
 
1.9%
2
 
1.2%
2
 
1.2%
2
 
1.2%
1
 
0.6%
1
 
0.6%
Other values (10) 10
 
6.2%
Enclosed Alphanum Sup
ValueCountFrequency (%)
🇦 31
17.1%
🇺 29
16.0%
🇧 17
9.4%
🇷 15
8.3%
🇨 15
8.3%
🇬 11
 
6.1%
🇪 8
 
4.4%
🇸 8
 
4.4%
🇾 6
 
3.3%
🇮 6
 
3.3%
Other values (13) 35
19.3%
Misc Symbols
ValueCountFrequency (%)
30
21.9%
21
15.3%
14
10.2%
12
 
8.8%
9
 
6.6%
5
 
3.6%
4
 
2.9%
4
 
2.9%
3
 
2.2%
3
 
2.2%
Other values (24) 32
23.4%
Cyrillic
ValueCountFrequency (%)
а 27
 
11.1%
о 18
 
7.4%
т 18
 
7.4%
н 14
 
5.7%
е 13
 
5.3%
и 12
 
4.9%
в 12
 
4.9%
с 11
 
4.5%
у 9
 
3.7%
р 8
 
3.3%
Other values (31) 102
41.8%
CJK
ValueCountFrequency (%)
25
 
1.6%
20
 
1.3%
20
 
1.3%
14
 
0.9%
13
 
0.9%
13
 
0.9%
12
 
0.8%
11
 
0.7%
11
 
0.7%
11
 
0.7%
Other values (680) 1371
90.1%
Hiragana
ValueCountFrequency (%)
11
 
7.1%
11
 
7.1%
8
 
5.2%
7
 
4.5%
7
 
4.5%
7
 
4.5%
7
 
4.5%
6
 
3.9%
6
 
3.9%
6
 
3.9%
Other values (30) 79
51.0%
Katakana
ValueCountFrequency (%)
10
 
11.9%
10
 
11.9%
5
 
6.0%
4
 
4.8%
4
 
4.8%
3
 
3.6%
3
 
3.6%
3
 
3.6%
3
 
3.6%
3
 
3.6%
Other values (27) 36
42.9%
Arabic
ValueCountFrequency (%)
ا 10
14.7%
م 8
11.8%
و 7
10.3%
ت 6
 
8.8%
ل 5
 
7.4%
ر 4
 
5.9%
ع 4
 
5.9%
ة 3
 
4.4%
ي 3
 
4.4%
ى 2
 
2.9%
Other values (13) 16
23.5%
Diacriticals
ValueCountFrequency (%)
̶ 10
 
7.8%
͡ 6
 
4.7%
̭ 6
 
4.7%
̯ 6
 
4.7%
͉ 6
 
4.7%
́ 5
 
3.9%
̘ 4
 
3.1%
̪ 4
 
3.1%
͜ 4
 
3.1%
̩ 4
 
3.1%
Other values (40) 73
57.0%
Compat Jamo
ValueCountFrequency (%)
8
100.0%
Geometric Shapes Ext
ValueCountFrequency (%)
🟦 8
57.1%
🟨 6
42.9%
Arrows
ValueCountFrequency (%)
8
53.3%
4
26.7%
3
 
20.0%
Emoticons
ValueCountFrequency (%)
🙈 7
 
11.9%
🙉 6
 
10.2%
😄 4
 
6.8%
🙂 4
 
6.8%
😎 3
 
5.1%
😉 2
 
3.4%
🙏 2
 
3.4%
😍 2
 
3.4%
😁 2
 
3.4%
😋 2
 
3.4%
Other values (18) 25
42.4%
Hangul
ValueCountFrequency (%)
7
 
4.2%
7
 
4.2%
5
 
3.0%
4
 
2.4%
4
 
2.4%
4
 
2.4%
4
 
2.4%
4
 
2.4%
3
 
1.8%
3
 
1.8%
Other values (100) 120
72.7%
Geometric Shapes
ValueCountFrequency (%)
6
31.6%
2
 
10.5%
2
 
10.5%
2
 
10.5%
2
 
10.5%
2
 
10.5%
1
 
5.3%
1
 
5.3%
1
 
5.3%
Box Drawing
ValueCountFrequency (%)
6
28.6%
6
28.6%
5
23.8%
3
14.3%
1
 
4.8%
PUA
ValueCountFrequency (%)
6
40.0%
2
 
13.3%
2
 
13.3%
2
 
13.3%
1
 
6.7%
1
 
6.7%
1
 
6.7%
Letterlike Symbols
ValueCountFrequency (%)
5
100.0%
Currency Symbols
ValueCountFrequency (%)
5
100.0%
Sup Punctuation
ValueCountFrequency (%)
4
100.0%
Math Operators
ValueCountFrequency (%)
3
23.1%
2
15.4%
2
15.4%
1
 
7.7%
1
 
7.7%
1
 
7.7%
1
 
7.7%
1
 
7.7%
1
 
7.7%
Hebrew
ValueCountFrequency (%)
מ 2
14.3%
ר 2
14.3%
ש 2
14.3%
ו 1
7.1%
א 1
7.1%
ל 1
7.1%
ע 1
7.1%
ה 1
7.1%
ח 1
7.1%
י 1
7.1%
Misc Technical
ValueCountFrequency (%)
2
28.6%
1
14.3%
1
14.3%
1
14.3%
1
14.3%
1
14.3%
IPA Ext
ValueCountFrequency (%)
ʖ 2
20.0%
ʕ 1
10.0%
ʔ 1
10.0%
ʀ 1
10.0%
ɴ 1
10.0%
ɾ 1
10.0%
ɚ 1
10.0%
ɹ 1
10.0%
ɛ 1
10.0%
Math Alphanum
ValueCountFrequency (%)
𝘵 2
 
8.3%
𝘴 2
 
8.3%
𝘭 2
 
8.3%
𝘶 2
 
8.3%
𝒾 1
 
4.2%
𝒽 1
 
4.2%
𝐂 1
 
4.2%
𝐑 1
 
4.2%
𝟎 1
 
4.2%
𝟖 1
 
4.2%
Other values (10) 10
41.7%
Phonetic Ext
ValueCountFrequency (%)
2
33.3%
1
16.7%
1
16.7%
1
16.7%
1
16.7%
Thai
ValueCountFrequency (%)
2
100.0%
CJK Compat Forms
ValueCountFrequency (%)
2
100.0%
Modifier Letters
ValueCountFrequency (%)
ˌ 2
40.0%
ˈ 2
40.0%
ː 1
20.0%
Kannada
ValueCountFrequency (%)
2
100.0%
Diacriticals Sup
ValueCountFrequency (%)
1
50.0%
1
50.0%
Mandaic
ValueCountFrequency (%)
1
100.0%
Mahjong
ValueCountFrequency (%)
🀄 1
100.0%
Jamo
ValueCountFrequency (%)
1
100.0%
Number Forms
ValueCountFrequency (%)
1
100.0%
Latin Ext Additional
ValueCountFrequency (%)
1
50.0%
1
50.0%
Egyptian Hieroglyphs
ValueCountFrequency (%)
𓀡 1
100.0%
Enclosed Alphanum
ValueCountFrequency (%)
1
100.0%
Tibetan
ValueCountFrequency (%)
1
50.0%
1
50.0%
Devanagari
ValueCountFrequency (%)
1
100.0%

public_repos
Real number (ℝ)

High correlation  Skewed  Zeros 

Distinct674
Distinct (%)3.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean84.139215
Minimum0
Maximum50000
Zeros942
Zeros (%)4.8%
Negative0
Negative (%)0.0%
Memory size154.6 KiB
2024-11-27T11:58:44.394207image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile1
Q111
median35
Q383
95-th percentile250
Maximum50000
Range50000
Interquartile range (IQR)72

Descriptive statistics

Standard deviation574.75022
Coefficient of variation (CV)6.8309434
Kurtosis3700.1203
Mean84.139215
Median Absolute Deviation (MAD)29
Skewness53.884747
Sum1663264
Variance330337.81
MonotonicityNot monotonic
2024-11-27T11:58:44.475598image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0 942
 
4.8%
1 551
 
2.8%
2 465
 
2.4%
3 396
 
2.0%
4 380
 
1.9%
6 364
 
1.8%
5 357
 
1.8%
7 330
 
1.7%
9 312
 
1.6%
8 307
 
1.6%
Other values (664) 15364
77.7%
ValueCountFrequency (%)
0 942
4.8%
1 551
2.8%
2 465
2.4%
3 396
2.0%
4 380
1.9%
5 357
 
1.8%
6 364
 
1.8%
7 330
 
1.7%
8 307
 
1.6%
9 312
 
1.6%
ValueCountFrequency (%)
50000 1
< 0.1%
27746 1
< 0.1%
26360 1
< 0.1%
22618 1
< 0.1%
20693 1
< 0.1%
17425 1
< 0.1%
16985 1
< 0.1%
16839 1
< 0.1%
9666 1
< 0.1%
9554 1
< 0.1%

public_gists
Real number (ℝ)

High correlation  Skewed  Zeros 

Distinct359
Distinct (%)1.8%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean25.214083
Minimum0
Maximum55781
Zeros7961
Zeros (%)40.3%
Negative0
Negative (%)0.0%
Memory size154.6 KiB
2024-11-27T11:58:44.557642image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median2
Q310
95-th percentile66
Maximum55781
Range55781
Interquartile range (IQR)10

Descriptive statistics

Standard deviation635.69014
Coefficient of variation (CV)25.211709
Kurtosis5955.7935
Mean25.214083
Median Absolute Deviation (MAD)2
Skewness74.090637
Sum498432
Variance404101.96
MonotonicityNot monotonic
2024-11-27T11:58:44.643090image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0 7961
40.3%
1 1873
 
9.5%
2 1152
 
5.8%
3 823
 
4.2%
4 665
 
3.4%
5 627
 
3.2%
6 488
 
2.5%
7 405
 
2.0%
9 327
 
1.7%
8 318
 
1.6%
Other values (349) 5129
25.9%
ValueCountFrequency (%)
0 7961
40.3%
1 1873
 
9.5%
2 1152
 
5.8%
3 823
 
4.2%
4 665
 
3.4%
5 627
 
3.2%
6 488
 
2.5%
7 405
 
2.0%
8 318
 
1.6%
9 327
 
1.7%
ValueCountFrequency (%)
55781 1
< 0.1%
53660 1
< 0.1%
28943 1
< 0.1%
26879 1
< 0.1%
15482 1
< 0.1%
10604 1
< 0.1%
3450 1
< 0.1%
3170 1
< 0.1%
2565 1
< 0.1%
1750 1
< 0.1%

followers
Real number (ℝ)

High correlation  Skewed  Zeros 

Distinct1598
Distinct (%)8.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean245.49702
Minimum0
Maximum95752
Zeros1445
Zeros (%)7.3%
Negative0
Negative (%)0.0%
Memory size154.6 KiB
2024-11-27T11:58:44.735823image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q17
median33
Q3125
95-th percentile836
Maximum95752
Range95752
Interquartile range (IQR)118

Descriptive statistics

Standard deviation1535.94
Coefficient of variation (CV)6.2564506
Kurtosis1570.3008
Mean245.49702
Median Absolute Deviation (MAD)31
Skewness32.466028
Sum4852985
Variance2359111.6
MonotonicityNot monotonic
2024-11-27T11:58:44.828873image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0 1445
 
7.3%
1 803
 
4.1%
2 623
 
3.2%
3 515
 
2.6%
4 450
 
2.3%
5 415
 
2.1%
6 396
 
2.0%
7 347
 
1.8%
8 338
 
1.7%
9 311
 
1.6%
Other values (1588) 14125
71.5%
ValueCountFrequency (%)
0 1445
7.3%
1 803
4.1%
2 623
3.2%
3 515
 
2.6%
4 450
 
2.3%
5 415
 
2.1%
6 396
 
2.0%
7 347
 
1.8%
8 338
 
1.7%
9 311
 
1.6%
ValueCountFrequency (%)
95752 1
< 0.1%
84979 1
< 0.1%
66203 1
< 0.1%
58452 1
< 0.1%
31120 1
< 0.1%
30287 1
< 0.1%
29719 1
< 0.1%
29414 1
< 0.1%
28411 1
< 0.1%
25815 1
< 0.1%

following
Real number (ℝ)

High correlation  Skewed  Zeros 

Distinct620
Distinct (%)3.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean44.520741
Minimum0
Maximum27775
Zeros6017
Zeros (%)30.4%
Negative0
Negative (%)0.0%
Memory size154.6 KiB
2024-11-27T11:58:44.911160image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median4
Q322
95-th percentile148
Maximum27775
Range27775
Interquartile range (IQR)22

Descriptive statistics

Standard deviation366.79344
Coefficient of variation (CV)8.2387093
Kurtosis2260.6155
Mean44.520741
Median Absolute Deviation (MAD)4
Skewness39.874154
Sum880086
Variance134537.43
MonotonicityNot monotonic
2024-11-27T11:58:44.994503image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0 6017
30.4%
1 1734
 
8.8%
2 1092
 
5.5%
3 794
 
4.0%
4 602
 
3.0%
5 533
 
2.7%
6 484
 
2.4%
7 407
 
2.1%
8 368
 
1.9%
9 322
 
1.6%
Other values (610) 7415
37.5%
ValueCountFrequency (%)
0 6017
30.4%
1 1734
 
8.8%
2 1092
 
5.5%
3 794
 
4.0%
4 602
 
3.0%
5 533
 
2.7%
6 484
 
2.4%
7 407
 
2.1%
8 368
 
1.9%
9 322
 
1.6%
ValueCountFrequency (%)
27775 1
< 0.1%
16741 1
< 0.1%
15931 1
< 0.1%
11921 1
< 0.1%
10268 1
< 0.1%
9720 1
< 0.1%
9686 1
< 0.1%
9532 1
< 0.1%
9367 1
< 0.1%
7374 1
< 0.1%

created_at
Real number (ℝ)

High correlation 

Distinct19767
Distinct (%)> 99.9%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1.4145822 × 109
Minimum1.2014178 × 109
Maximum1.6399782 × 109
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size154.6 KiB
2024-11-27T11:58:45.085271image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/

Quantile statistics

Minimum1.2014178 × 109
5-th percentile1.2494106 × 109
Q11.3331265 × 109
median1.4031753 × 109
Q31.4978973 × 109
95-th percentile1.5973464 × 109
Maximum1.6399782 × 109
Range4.3856039 × 108
Interquartile range (IQR)1.6477081 × 108

Descriptive statistics

Standard deviation1.0675296 × 108
Coefficient of variation (CV)0.075466074
Kurtosis-0.89541378
Mean1.4145822 × 109
Median Absolute Deviation (MAD)80133068
Skewness0.18356714
Sum2.7963461 × 1013
Variance1.1396195 × 1016
MonotonicityNot monotonic
2024-11-27T11:58:45.170907image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1247664042 2
 
< 0.1%
1317058023 1
 
< 0.1%
1417180054 1
 
< 0.1%
1341720219 1
 
< 0.1%
1348424237 1
 
< 0.1%
1271356600 1
 
< 0.1%
1336009883 1
 
< 0.1%
1301066311 1
 
< 0.1%
1287340808 1
 
< 0.1%
1326725682 1
 
< 0.1%
Other values (19757) 19757
99.9%
ValueCountFrequency (%)
1201417787 1
< 0.1%
1201596634 1
< 0.1%
1202080438 1
< 0.1%
1202170661 1
< 0.1%
1202362686 1
< 0.1%
1202770412 1
< 0.1%
1202891072 1
< 0.1%
1202925333 1
< 0.1%
1202925642 1
< 0.1%
1202927153 1
< 0.1%
ValueCountFrequency (%)
1639978181 1
< 0.1%
1639063718 1
< 0.1%
1638745117 1
< 0.1%
1638268856 1
< 0.1%
1637693729 1
< 0.1%
1637692402 1
< 0.1%
1637686847 1
< 0.1%
1637587854 1
< 0.1%
1637513272 1
< 0.1%
1637212252 1
< 0.1%

updated_at
Real number (ℝ)

High correlation 

Distinct19633
Distinct (%)99.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1.6896785 × 109
Minimum1.4706947 × 109
Maximum1.697294 × 109
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size154.6 KiB
2024-11-27T11:58:45.255507image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/

Quantile statistics

Minimum1.4706947 × 109
5-th percentile1.6595533 × 109
Q11.6915382 × 109
median1.6954963 × 109
Q31.6966558 × 109
95-th percentile1.6971782 × 109
Maximum1.697294 × 109
Range2.2659934 × 108
Interquartile range (IQR)5117602.8

Descriptive statistics

Standard deviation18095580
Coefficient of variation (CV)0.010709481
Kurtosis35.311546
Mean1.6896785 × 109
Median Absolute Deviation (MAD)1477031
Skewness-5.2026243
Sum3.3401565 × 1013
Variance3.2745002 × 1014
MonotonicityNot monotonic
2024-11-27T11:58:45.344401image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1697023264 3
 
< 0.1%
1697023072 3
 
< 0.1%
1695900030 3
 
< 0.1%
1695900245 3
 
< 0.1%
1695468012 3
 
< 0.1%
1697023048 3
 
< 0.1%
1696356868 3
 
< 0.1%
1695899963 2
 
< 0.1%
1696591930 2
 
< 0.1%
1696418321 2
 
< 0.1%
Other values (19623) 19741
99.9%
ValueCountFrequency (%)
1470694689 1
< 0.1%
1497465171 1
< 0.1%
1497465692 1
< 0.1%
1497469793 1
< 0.1%
1497470617 1
< 0.1%
1497470746 1
< 0.1%
1497471211 1
< 0.1%
1497471213 1
< 0.1%
1497471722 1
< 0.1%
1499154173 1
< 0.1%
ValueCountFrequency (%)
1697294028 1
< 0.1%
1697290182 1
< 0.1%
1697288420 1
< 0.1%
1697288225 1
< 0.1%
1697288102 1
< 0.1%
1697287820 1
< 0.1%
1697287405 1
< 0.1%
1697287346 1
< 0.1%
1697287260 1
< 0.1%
1697287051 1
< 0.1%

text_bot_count
Real number (ℝ)

High correlation  Zeros 

Distinct6
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean0.061361797
Minimum0
Maximum5
Zeros19003
Zeros (%)96.1%
Negative0
Negative (%)0.0%
Memory size154.6 KiB
2024-11-27T11:58:45.420341image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q30
95-th percentile0
Maximum5
Range5
Interquartile range (IQR)0

Descriptive statistics

Standard deviation0.34100309
Coefficient of variation (CV)5.5572539
Kurtosis51.672415
Mean0.061361797
Median Absolute Deviation (MAD)0
Skewness6.674794
Sum1213
Variance0.11628311
MonotonicityNot monotonic
2024-11-27T11:58:45.490078image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
Histogram with fixed size bins (bins=6)
ValueCountFrequency (%)
0 19003
96.1%
1 425
 
2.1%
2 251
 
1.3%
3 75
 
0.4%
4 9
 
< 0.1%
5 5
 
< 0.1%
ValueCountFrequency (%)
0 19003
96.1%
1 425
 
2.1%
2 251
 
1.3%
3 75
 
0.4%
4 9
 
< 0.1%
5 5
 
< 0.1%
ValueCountFrequency (%)
5 5
 
< 0.1%
4 9
 
< 0.1%
3 75
 
0.4%
2 251
 
1.3%
1 425
 
2.1%
0 19003
96.1%

public_repos_log
Real number (ℝ)

High correlation  Zeros 

Distinct674
Distinct (%)3.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean3.3934449
Minimum0
Maximum10.819798
Zeros942
Zeros (%)4.8%
Negative0
Negative (%)0.0%
Memory size154.6 KiB
2024-11-27T11:58:45.574006image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0.69314718
Q12.4849066
median3.5835189
Q34.4308168
95-th percentile5.5254529
Maximum10.819798
Range10.819798
Interquartile range (IQR)1.9459101

Descriptive statistics

Standard deviation1.4801216
Coefficient of variation (CV)0.4361708
Kurtosis0.063401207
Mean3.3934449
Median Absolute Deviation (MAD)0.94446161
Skewness-0.38244821
Sum67081.618
Variance2.1907598
MonotonicityNot monotonic
2024-11-27T11:58:45.660058image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0 942
 
4.8%
0.6931471806 551
 
2.8%
1.098612289 465
 
2.4%
1.386294361 396
 
2.0%
1.609437912 380
 
1.9%
1.945910149 364
 
1.8%
1.791759469 357
 
1.8%
2.079441542 330
 
1.7%
2.302585093 312
 
1.6%
2.197224577 307
 
1.6%
Other values (664) 15364
77.7%
ValueCountFrequency (%)
0 942
4.8%
0.6931471806 551
2.8%
1.098612289 465
2.4%
1.386294361 396
2.0%
1.609437912 380
1.9%
1.791759469 357
 
1.8%
1.945910149 364
 
1.8%
2.079441542 330
 
1.7%
2.197224577 307
 
1.6%
2.302585093 312
 
1.6%
ValueCountFrequency (%)
10.81979828 1
< 0.1%
10.23088301 1
< 0.1%
10.17964092 1
< 0.1%
10.02654554 1
< 0.1%
9.937599082 1
< 0.1%
9.765718623 1
< 0.1%
9.740144754 1
< 0.1%
9.731512288 1
< 0.1%
9.176473302 1
< 0.1%
9.164819857 1
< 0.1%

public_gists_log
Real number (ℝ)

High correlation  Zeros 

Distinct359
Distinct (%)1.8%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1.3667909
Minimum0
Maximum10.929207
Zeros7961
Zeros (%)40.3%
Negative0
Negative (%)0.0%
Memory size154.6 KiB
2024-11-27T11:58:46.010674image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median1.0986123
Q32.3978953
95-th percentile4.2046926
Maximum10.929207
Range10.929207
Interquartile range (IQR)2.3978953

Descriptive statistics

Standard deviation1.4937885
Coefficient of variation (CV)1.0929166
Kurtosis0.26107473
Mean1.3667909
Median Absolute Deviation (MAD)1.0986123
Skewness0.93069164
Sum27018.723
Variance2.231404
MonotonicityNot monotonic
2024-11-27T11:58:46.093687image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0 7961
40.3%
0.6931471806 1873
 
9.5%
1.098612289 1152
 
5.8%
1.386294361 823
 
4.2%
1.609437912 665
 
3.4%
1.791759469 627
 
3.2%
1.945910149 488
 
2.5%
2.079441542 405
 
2.0%
2.302585093 327
 
1.7%
2.197224577 318
 
1.6%
Other values (349) 5129
25.9%
ValueCountFrequency (%)
0 7961
40.3%
0.6931471806 1873
 
9.5%
1.098612289 1152
 
5.8%
1.386294361 823
 
4.2%
1.609437912 665
 
3.4%
1.791759469 627
 
3.2%
1.945910149 488
 
2.5%
2.079441542 405
 
2.0%
2.197224577 318
 
1.6%
2.302585093 327
 
1.7%
ValueCountFrequency (%)
10.92920652 1
< 0.1%
10.89044176 1
< 0.1%
10.27311821 1
< 0.1%
10.19913779 1
< 0.1%
9.647497927 1
< 0.1%
9.269080867 1
< 0.1%
8.146419323 1
< 0.1%
8.061802275 1
< 0.1%
7.850103545 1
< 0.1%
7.467942332 1
< 0.1%

followers_log
Real number (ℝ)

High correlation  Zeros 

Distinct1598
Distinct (%)8.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean3.5025164
Minimum0
Maximum11.469527
Zeros1445
Zeros (%)7.3%
Negative0
Negative (%)0.0%
Memory size154.6 KiB
2024-11-27T11:58:46.176639image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q12.0794415
median3.5263605
Q34.8362819
95-th percentile6.7298241
Maximum11.469527
Range11.469527
Interquartile range (IQR)2.7568404

Descriptive statistics

Standard deviation1.9557633
Coefficient of variation (CV)0.55838804
Kurtosis-0.29389155
Mean3.5025164
Median Absolute Deviation (MAD)1.3291359
Skewness0.12973968
Sum69237.744
Variance3.8250099
MonotonicityNot monotonic
2024-11-27T11:58:46.270653image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0 1445
 
7.3%
0.6931471806 803
 
4.1%
1.098612289 623
 
3.2%
1.386294361 515
 
2.6%
1.609437912 450
 
2.3%
1.791759469 415
 
2.1%
1.945910149 396
 
2.0%
2.079441542 347
 
1.8%
2.197224577 338
 
1.7%
2.302585093 311
 
1.6%
Other values (1588) 14125
71.5%
ValueCountFrequency (%)
0 1445
7.3%
0.6931471806 803
4.1%
1.098612289 623
3.2%
1.386294361 515
 
2.6%
1.609437912 450
 
2.3%
1.791759469 415
 
2.1%
1.945910149 396
 
2.0%
2.079441542 347
 
1.8%
2.197224577 338
 
1.7%
2.302585093 311
 
1.6%
ValueCountFrequency (%)
11.46952724 1
< 0.1%
11.35017121 1
< 0.1%
11.10049616 1
< 0.1%
10.97597829 1
< 0.1%
10.34563811 1
< 0.1%
10.31850687 1
< 0.1%
10.2995755 1
< 0.1%
10.28926003 1
< 0.1%
10.25456687 1
< 0.1%
10.15874973 1
< 0.1%

following_log
Real number (ℝ)

High correlation  Zeros 

Distinct620
Distinct (%)3.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1.8589591
Minimum0
Maximum10.231928
Zeros6017
Zeros (%)30.4%
Negative0
Negative (%)0.0%
Memory size154.6 KiB
2024-11-27T11:58:46.367481image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median1.6094379
Q33.1354942
95-th percentile5.0039463
Maximum10.231928
Range10.231928
Interquartile range (IQR)3.1354942

Descriptive statistics

Standard deviation1.743082
Coefficient of variation (CV)0.93766562
Kurtosis-0.25441172
Mean1.8589591
Median Absolute Deviation (MAD)1.6094379
Skewness0.68128993
Sum36747.903
Variance3.0383349
MonotonicityNot monotonic
2024-11-27T11:58:46.459423image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0 6017
30.4%
0.6931471806 1734
 
8.8%
1.098612289 1092
 
5.5%
1.386294361 794
 
4.0%
1.609437912 602
 
3.0%
1.791759469 533
 
2.7%
1.945910149 484
 
2.4%
2.079441542 407
 
2.1%
2.197224577 368
 
1.9%
2.302585093 322
 
1.6%
Other values (610) 7415
37.5%
ValueCountFrequency (%)
0 6017
30.4%
0.6931471806 1734
 
8.8%
1.098612289 1092
 
5.5%
1.386294361 794
 
4.0%
1.609437912 602
 
3.0%
1.791759469 533
 
2.7%
1.945910149 484
 
2.4%
2.079441542 407
 
2.1%
2.197224577 368
 
1.9%
2.302585093 322
 
1.6%
ValueCountFrequency (%)
10.23192762 1
< 0.1%
9.725675811 1
< 0.1%
9.676084944 1
< 0.1%
9.386140712 1
< 0.1%
9.236884927 1
< 0.1%
9.182043773 1
< 0.1%
9.178540059 1
< 0.1%
9.162514742 1
< 0.1%
9.145054905 1
< 0.1%
8.905851181 1
< 0.1%

Interactions

2024-11-27T11:58:42.313137image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-11-27T11:58:34.493036image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-11-27T11:58:35.521065image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-11-27T11:58:36.270706image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-11-27T11:58:36.972044image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-11-27T11:58:37.828966image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-11-27T11:58:38.482828image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-11-27T11:58:39.224317image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-11-27T11:58:40.027845image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-11-27T11:58:40.828298image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-11-27T11:58:41.659165image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-11-27T11:58:42.369952image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-11-27T11:58:34.759722image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-11-27T11:58:35.595880image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-11-27T11:58:36.331483image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-11-27T11:58:37.052393image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-11-27T11:58:37.887385image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-11-27T11:58:38.541118image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-11-27T11:58:39.295380image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-11-27T11:58:40.107701image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-11-27T11:58:40.889161image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-11-27T11:58:41.721178image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-11-27T11:58:42.428010image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-11-27T11:58:34.838814image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-11-27T11:58:35.674585image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-11-27T11:58:36.390430image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-11-27T11:58:37.122936image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-11-27T11:58:37.947381image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-11-27T11:58:38.598970image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-11-27T11:58:39.368886image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-11-27T11:58:40.184259image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-11-27T11:58:40.945250image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-11-27T11:58:41.787709image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-11-27T11:58:42.485600image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-11-27T11:58:34.916229image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-11-27T11:58:35.752957image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-11-27T11:58:36.447888image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-11-27T11:58:37.180564image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-11-27T11:58:38.005020image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-11-27T11:58:38.658106image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-11-27T11:58:39.436995image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-11-27T11:58:40.249275image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-11-27T11:58:41.003077image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-11-27T11:58:41.844728image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-11-27T11:58:42.546403image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-11-27T11:58:34.992398image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-11-27T11:58:35.838065image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-11-27T11:58:36.505453image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-11-27T11:58:37.238120image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-11-27T11:58:38.062128image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-11-27T11:58:38.719425image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-11-27T11:58:39.509145image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-11-27T11:58:40.317933image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-11-27T11:58:41.060390image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-11-27T11:58:41.904908image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-11-27T11:58:42.603294image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-11-27T11:58:35.074844image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-11-27T11:58:35.901629image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-11-27T11:58:36.563638image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-11-27T11:58:37.299707image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-11-27T11:58:38.118281image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-11-27T11:58:38.782469image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-11-27T11:58:39.589530image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-11-27T11:58:40.395927image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-11-27T11:58:41.118453image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-11-27T11:58:41.960598image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-11-27T11:58:42.662644image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-11-27T11:58:35.144847image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-11-27T11:58:35.959322image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-11-27T11:58:36.617845image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-11-27T11:58:37.360622image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-11-27T11:58:38.175660image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-11-27T11:58:38.849693image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-11-27T11:58:39.666896image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-11-27T11:58:40.463811image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-11-27T11:58:41.178356image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-11-27T11:58:42.020116image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-11-27T11:58:42.720758image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-11-27T11:58:35.218791image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-11-27T11:58:36.018521image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-11-27T11:58:36.677670image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-11-27T11:58:37.423568image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-11-27T11:58:38.243920image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-11-27T11:58:38.918840image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-11-27T11:58:39.736444image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-11-27T11:58:40.538924image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-11-27T11:58:41.234289image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-11-27T11:58:42.076777image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-11-27T11:58:42.777269image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-11-27T11:58:35.290437image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-11-27T11:58:36.077889image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-11-27T11:58:36.743401image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-11-27T11:58:37.481520image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-11-27T11:58:38.308895image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-11-27T11:58:38.993535image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-11-27T11:58:39.803069image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-11-27T11:58:40.613708image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-11-27T11:58:41.292805image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-11-27T11:58:42.135243image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-11-27T11:58:42.837917image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-11-27T11:58:35.371559image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-11-27T11:58:36.140850image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-11-27T11:58:36.824997image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-11-27T11:58:37.542518image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-11-27T11:58:38.366482image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-11-27T11:58:39.068752image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-11-27T11:58:39.874371image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-11-27T11:58:40.694239image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-11-27T11:58:41.350211image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-11-27T11:58:42.196598image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-11-27T11:58:42.900073image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-11-27T11:58:35.447679image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-11-27T11:58:36.205801image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-11-27T11:58:36.904012image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-11-27T11:58:37.603050image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-11-27T11:58:38.424313image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-11-27T11:58:39.148555image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-11-27T11:58:39.952943image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-11-27T11:58:40.763256image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-11-27T11:58:41.600506image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
2024-11-27T11:58:42.255604image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/

Correlations

2024-11-27T11:58:46.532483image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
blogcompanycreated_atfollowersfollowers_logfollowingfollowing_loghireablelabellocationpublic_gistspublic_gists_logpublic_repospublic_repos_logsite_admintext_bot_counttypeupdated_at
blog1.0000.2580.2870.0460.4270.0220.3590.2180.0240.3690.0120.3570.0000.3650.0050.0620.0800.156
company0.2581.0000.1700.0180.2590.0050.1960.0570.0700.3920.0000.1800.0100.1980.0250.0690.1020.139
created_at0.2870.1701.000-0.499-0.499-0.260-0.2600.1080.1110.266-0.529-0.529-0.463-0.4630.0220.1430.148-0.180
followers0.0460.018-0.4991.0001.0000.5370.5370.0000.0000.0200.5970.5970.6510.6510.000-0.1500.0000.296
followers_log0.4270.259-0.4991.0001.0000.5370.5370.2140.1630.3980.5970.5970.6510.6510.079-0.1500.2260.296
following0.0220.005-0.2600.5370.5371.0001.0000.0410.0000.0000.4380.4380.5370.5370.000-0.1590.0000.265
following_log0.3590.196-0.2600.5370.5371.0001.0000.2700.1650.3590.4380.4380.5370.5370.000-0.1590.1140.265
hireable0.2180.0570.1080.0000.2140.0410.2701.0000.0580.1780.0000.1990.0180.2320.0130.0490.0400.103
label0.0240.0700.1110.0000.1630.0000.1650.0581.0000.1300.0410.1410.0180.3690.0060.5790.3680.410
location0.3690.3920.2660.0200.3980.0000.3590.1780.1301.0000.0000.2880.0000.3570.0190.1310.1240.214
public_gists0.0120.000-0.5290.5970.5970.4380.4380.0000.0410.0001.0001.0000.6360.6360.000-0.1370.0000.248
public_gists_log0.3570.180-0.5290.5970.5970.4380.4380.1990.1410.2881.0001.0000.6360.6360.026-0.1370.0910.248
public_repos0.0000.010-0.4630.6510.6510.5370.5370.0180.0180.0000.6360.6361.0001.0000.000-0.2040.0000.304
public_repos_log0.3650.198-0.4630.6510.6510.5370.5370.2320.3690.3570.6360.6361.0001.0000.022-0.2040.3260.304
site_admin0.0050.0250.0220.0000.0790.0000.0000.0130.0060.0190.0000.0260.0000.0221.0000.0000.0000.000
text_bot_count0.0620.0690.143-0.150-0.150-0.159-0.1590.0490.5790.131-0.137-0.137-0.204-0.2040.0001.0000.510-0.174
type0.0800.1020.1480.0000.2260.0000.1140.0400.3680.1240.0000.0910.0000.3260.0000.5101.0000.801
updated_at0.1560.139-0.1800.2960.2960.2650.2650.1030.4100.2140.2480.2480.3040.3040.000-0.1740.8011.000

Missing values

2024-11-27T11:58:42.990588image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
A simple visualization of nullity by column.
2024-11-27T11:58:43.174117image/svg+xmlMatplotlib v3.8.3, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

labeltypesite_admincompanybloglocationhireablebiopublic_repospublic_gistsfollowersfollowingcreated_atupdated_attext_bot_countpublic_repos_logpublic_gists_logfollowers_logfollowing_log
0HumanTrueFalseFalseFalseFalseFalseNaN261511317058023169719607003.2958370.6931471.7917590.693147
1HumanTrueFalseFalseTrueFalseTrueI just press the buttons randomly, and the program evolves...303961435572766169665997403.4339871.3862942.3025851.945910
2HumanTrueFalseTrueTrueTrueTrueTime is unimportant,\nonly life important.1034912122211220026803169621268104.6443913.9120237.1008525.402677
3BotTrueFalseFalseFalseTrueFalseNaN4908421400611389169711529903.9120230.0000004.4426511.098612
4HumanTrueFalseFalseFalseFalseTrueNaN111621345126753169659352102.4849070.6931471.9459101.098612
5HumanTrueFalseTrueTrueTrueFalseDone studying. Need challenges.5612271491919687169700396604.0430510.6931473.1354942.079442
6HumanTrueFalseTrueTrueTrueTrueAdministrator of MOONGIFT that is introducing open source software everyday to Japanese engineers since 2004.277113963161207606942169580549605.6276217.0387844.1588832.833213
7HumanTrueFalseTrueFalseTrueFalseSenior Software Engineer at Google, working on Certificate Transparency and generalized transparency.3712201327010227169142439403.6375860.6931473.1354940.000000
8HumanTrueFalseFalseFalseFalseFalseNaN272375961577217873169711170103.3322051.0986123.6375866.391917
9HumanTrueFalseTrueTrueTrueFalseHi4291421374622174169688442503.7612002.3025852.7080501.098612
labeltypesite_admincompanybloglocationhireablebiopublic_repospublic_gistsfollowersfollowingcreated_atupdated_attext_bot_countpublic_repos_logpublic_gists_logfollowers_logfollowing_log
19758HumanTrueFalseTrueFalseTrueFalseNaN30010111473500700169659185103.4339870.0000002.3978952.484907
19759HumanTrueFalseFalseFalseTrueTrueNaN37199161334806034169670243203.6375862.9957324.5217891.945910
19760BotTrueFalseFalseFalseFalseFalseI am the bot account of @alvaroaleman10001544903731162739526520.6931470.0000000.0000000.000000
19761HumanTrueFalseFalseFalseFalseFalseNaN30101384099537169349196821.3862940.0000000.6931470.000000
19762HumanTrueFalseFalseFalseFalseFalseNaN00001601577032160927111200.0000000.0000000.0000000.000000
19763BotTrueFalseTrueTrueTrueFalseTony came to Linux in 1994 and has never looked back. His entire professional career has been spent working with or on Linux. First as a systems administrator36161141404343654169211751403.6109182.8332132.4849071.609438
19764HumanTrueFalseFalseFalseFalseFalseNaN160301512597391169039634502.8332130.0000001.3862940.000000
19765HumanTrueFalseTrueFalseTrueFalseSoftware engineer at RealTracs.1301011447512245166128898902.6390570.0000002.3978950.693147
19766HumanTrueFalseTrueFalseFalseFalseNaN70201637693729169663264502.0794420.0000001.0986120.000000
19767BotTrueFalseFalseFalseTrueFalseNaN100101461363119165722330102.3978950.0000000.6931470.000000